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PREFACE 


Experimentation  in  the  Defense  RDT&E  On-Line  System  (DROLS)  with  free 
text  searching  extends  back  to  December  1979  when  preliminary  testing 
was  conducted  in  the  Technical  Reports  Data  Base  (TR).  Once  the 
capability  was  explored  in  the  TR  system,  it  was  decided  to  apply  and 
test  the  technique  on  the  Research  and  Development  Program  Planning 
Data  Base  (R&DPP),  the  Research  and  Technology  Work  Unit  Information 
System  Data  Base  (WUIS),  and  the  Independent  Research  and  Development 
Data  Base  ( IR&D) .  These  data  bases  provide  technical  and  fiscal 
information  about  Department  of  Defense  related  R&D  programs,  so  within 
DTIC,  they  are  generally  referred  to  as  the  management  data  bases. 

The  Directorate  of  Data  Systems  in  cooperation  with  the  Office  of 
Information  Science  and  Technology  implemented  and  tested  free  text 
searching  in  the  management  data  bases.  This  report  was  prepared  in  an 
effort  to  describe  the  implementation  of  free  text  and  the  test 
resul  ts. 
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INTr  AUCTION 


In  an  effort  to  upgrade  technical  information  services  to  the  Defense 
research  and  development  community,  the  Defense  Technical  Information 
Center  (DTIC)  initiated  a  project  to  study  other  DoD,  federal  and 
commercial  on-line  systems  capabilities  during  1979.  One  of  the 
common  capabilities  among  the  systems  evaluated  was  the  ability  to 
retrieve  information  via  free  text  terms.  The  idea  that  free  text 
searching  is  an  important  feature  to  be  considered  in  requirements  for 
future  systems  is  supported  in  Information  Retrieval  On-Line  by 
Lancaster  &  Fayen.V 

Future  systems  should  be  oriented  toward  natural 
language  (e.g.,  of  abstracts)  rather  than 
controlled  vocabularies.  Specialized  information 
systems  will  require  the  specificity  of  natural 
language,  and  the  full  interactive,  browsing 
capabilities  of  such  systems  will  be  exploited 
more  efficiently  if  they  include  relatively 
substantial  portions  of  natural  language  text. 

For  the  practitioners  in  a  field,  natural  language 
is  almost  essential.  We  cannot  expect  such  users 
to  learn  the  nuances  and  idiosyncracies  of  a  large 
controlled  vocabulary  based  on  years  of  indexing 
practice  and  protocol.  In  addition  to  being 
highly  desirable,  natural  language  systems  are 
becoming  increasingly  feasible  because  of  the 
increasing  availability  of  machine-readable  data 
bases  as  by-products  of  other  operations,  and 
because  of  the  fact  that  digital  storage  costs  are 
coming  down  and  are  likely  to  come  down  much 
further  with  the  development  of  mass  storage 
devices  of  various  types. 

During  1980  through  1981  the  Office  of  Data  Systems  in  cooperation 
with  Office  of  Information  Science  and  Technology  developed  the 
capability  of  free  text  searching  in  the  Defense  RDT&E  On-line 
System  (DROLS).  The  purpose  of  this  paper  is  to  describe  the 
implementation  and  testing  of  free  text  searching  in  the  management 
data  bases. 


1.  F.W.  Lancaster  and  E.G.  Fayen,  I nf ormati on  Retr i eval  On- 1 i ne 
(Los  Angeles,  California:  John  Wiley  &  Sons,  Inc.,  1973),  p.  413. 
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"Free  text  searching"  for  the  DTIC  experiment  is  defined  as:  single 
terms  or  words  that  have  been  extracted  from  narrative  portions  of  the 
data  base  that  can  be  retrieved  directly  from  the  inverted  file.  In 
other  online  systems,  such  as  the  National  Library  of  Medicine/MEDLARS 
or  the  Systems  Development  Corporation/ORBIT”,  "free  text"  may  be 
referred  to  as  full  text,  natural  language,  text  words,  or 
uncontrolled  terms.  The  information  community  has  not  yet  chosen  a 
standard,  so  for  purposes  of  this  paper  the  phrase  "free  text"  will  be 
empl  oyed . 

BACKGROUND 

Searching  by  free  text  is  very  similar  to  searching  by  descriptors  or 
open-ended  terms.  Conceptual  analysis,  or  breaking  the  search 
question  into  concepts,  is  required.  The  major  difference  between 
controlled  vocabulary  searching  and  free  text  searching  is  the 
searcher  must  decide  which  of  the  variant  text  words  are  likely  to 
distinguish  relevant  documents  from  irrelevant  ones.  In  some  ways 
free  text  searching  is  more  difficult  than  controlled  vocabulary 
searching  because  much  of  the  burden  of  term  selection  is  placed  on 
the  searcher;  however,  it  offers  a  number  of  benefits  which  otherwise 
would  not  be  realized.  For  instance,  it  is  possible  to  look  for 
documents  with  individual  personal  or  product  names.  "Planck 
constant",  "maxwell's  rule",  or  "Pratt  and  Whitney"  are  examples  of 
searches  that  would  be  easily  handled  by  free  text.  It  is  also 
possible  to  reduce  a  search  to  a  smaller  number  of  elements  than  in  a 
controlled  vocabulary  search.  For  example  the  free  text  term  "clams" 
instead  of  the  descriptor  "shellfish"  is  a  much  narrower  concept; 
therefore,  a  smaller  portion  of  the  file  must  be  reviewed.^/ 

For  further  clarification  on  the  capabilities  of  free  text  searching, 
see  the  text  word  searching  rules  in  appendix  A. 

Preliminary  testing  was  conducted  with  the  Technical  Reports  Data 
Base.  Based  on  frequency  counts  and  retrieval  experience  gained 
during  the  preliminary  tests,  a  DTIC  stopword  list  and  free  text 
searching  rules  were  developed.  (See  appendix  A  and  appendix  B.)  For 
further  information  on  the  Technical  Reports  test  see  DTIC  Free  Text 
Experiment  -  Technical  Reports  File. 3/ 

The  free  text  programs  were  changed  to  process  text  for  each  of  the 
management  data  bases  and  the  on-line  system  programs  were  modified  by 
adding  a  new  role  code  (60)  so  free  text  terms  could  be  isolated  from 
other  inverted  file  terminology.  To  retrieve  free  text  terms  the  role 
code  must  be  used;  that  is,  they  are  left  out  of  the  search  unless 


2.  F.W.  Lancaster,  Information  Retrieval  Systems  (New  York: 
John  Wiley  &  Sons,  Inc.,  1979),  pp.  270-292. 

3.  J.L.  Carney  et  al . ,  DTIC  Free  Text  Experiment  -  Technical 
Reports  File  (Alexandria,  Va.:  Defense  Technical  Information  Center, 
1961),  pp.  1-7. 
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specifically  requested.  Once  these  modifications  were  made,  free  text 
searching  was  implemented  consecutively  in  each  of  the  management  data 
bases:  Research  and  Technology  Work  Unit,  Research  and  Development 
Program  Planning,  and  Independent  Research  and  Development. 


EVALUATION  AREAS 

There  were  four  questions  that  were  of  concern  while  implementing  free 
text  searching  in  the  managment  data  bases: 

1)  How  will  relevance  of  retrieval  be  affected  by  free  text? 

2)  Will  the  number  of  records  retrieved  be  significantly  larger? 

3)  How  much  larger  will  the  management  data  base  inverted  files 
be? 

4)  How  will  computer  response  time  be  affected  by  the  larger 
free  text  inverted  files? 

Each  of  the  questions  will  be  discussed  separately  in  the  results 
section. 


TEST  CONSTRUCTION 


To  test  free  text  searching  in  the  management  data  bases,  29  descrip¬ 
tors  from  the  DTIC  Retrieval  and  Indexing  Terminology  (DRIT)V  were 
randomly  selected  and  converted  to  a  free  text  searching  format.  For 
example,  the  index  term  RADAR  CAMOUFLAGE  would  be  searched  in  the  free 
text  format  as  well  as  the  descriptor  format: 


Descriptor  Strate gy 


oswuo 

RADAR  CAMOUFLAGE 
END 


(COMMAND) 

(DESCRIPTOR) 

(COMMAND  TERMINATOR) 


Free  Text 


@SWU@ 

760RADAR 

AND 

?60CAM0UFLAGE 

END 


(COMMAND) 

(ROLE  CODE  and  SEARCH  TERM) 
(BOOLEAN  OPERATOR) 

(ROLE  CODE  and  SEARCH  TERM) 
(COMMAND  TERMINATOR) 


4.  U.S.,  Department  of  Defense,  Defense  Technical  Information 
Center,  DTIC  Retrieval  and  Indexing  Terminology:  Posting  Terms  with 
Hierarc 


In  addition,  a  comprehensive  search  strategy  was  constructed  that 
would  retrieve  descriptors  as  well  as  free  text  terms. 

Combination  Strategy 


@SWU@ 

RADAR  CAMOUFLAGE 

760RADAR 

AND 

RADAR  CAMOUFLAGE 

760CAM0UFLAGE 

END 


(COMMAND) 

(DESCRIPTOR) 

(ROLE  CODE  and  SEARCH  TERM) 
(BOOLEAN  OPERATOR) 
(DESCRIPTOR) 

(ROLE  CODE  and  SEARCH  TERM) 
(COMMAND  TERMINATOR) 


It  should  be  noted  that  the  descriptor  appears  on  each  level  because 
DROLS  does  not  allow  the  use  of  parenthesis  in  Hs  Boolean  logic. 
Putting  descriptors  on  each  level  is  a  forced  "OR"  condition. 

The  following  venn  diagrams  will  clarify  the  logic.  For  searching 
the  concept  RADAR  CAMOUFLAGE  both  the  descriptor  and  the  free  text 
terms  are  of  interest.  We  want  to  retrieve  the  descriptor  and  the 
conjunction  of  text  words  RADAR  and  CAMOUFLAGE. 

(a)  (b)  (c) 

RADAR  CAMOUFLAGE  OR  RADAR  AND  CAMOUFLAGE 


Without  the  parentheses  the  results  would  be: 

(a)  (b)  (c) 

RADAR  CAMOUFLAGE  OR  RADAR  AND  CAMOUFLAGE 


a 


because  the  "OR"  operation  is  done  first  using  Boolean  logic. 
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Using  parentheses  would  allow  the  searcher  to  signal  which  operation 
is  to  take  precedence. 

(a)  (b)  (c) 

RADAR  CAMOUFLAGE  OR  (RADAR  AND  CAMOUFLAGE) 


a 


The  "AND"  opertation  is  done  before  the  "OR"  operation. 

To  achieve  the  same  results  using  DROLS  logic  the  descriptor  must  be 
combined  with  each  of  the  free  text  term  by  a  Boolean  "OR". 

(a)  (b)  (a)  (c) 

RADAR  CAMOUFLAGE  OR  RADAR  AND  RADAR  CAMOUFLAGE  OR  CAMOUFLAGE 

a  a 


The  or  operations  are  done  first,  and  then  the  results  are  combined. 

Once  the  three  types  of  search  strategies  (descriptor,  free  text,  and 
combined)  were  constructed,  they  were  run  against  the  three  management 
data  bases  and  statistics  were  collected.  For  purposes  of  this  paper, 
discussion  of  the  results  will  be  generally  limited  to  the  Work  Unit 
Information  System.  The  Work  Unit  system  is  the  largest  of  the 
management  data  bases  and  it  is  felt  to  be  representative  of  the 
others. 


The  test  searches  were  run  periodically  between  April  22  and 
June  23,  1981  in  order  to  test  free  text  searching  during  a  variety  of 
conditions.  The  chart  in  appendix  C  gives  a  listing  of  Work  Unit 
search  dates  and  under  what  conditions  the  tests  were  conducted. 


RESULTS 

HOW  MUCH  LARGER  WILL  THE  MANAGEMENT  DATA  BASE  INVERTED  FILES  BE? 

The  following  table  lists  the  number  of  disk  tracks  that  the  managment 
data  bases  occupied  prior  to  the  implementation  of  free  text  searching 
and  the  number  they  now  occupy.  It  should  be  noted  that  file  updates 
may  have  occured  during  the  test  period  and  could  have  an  effect  on 
the  file  size. 

INVERTED  FILE  SIZES 


March  25,  1981 
(before  text 
inversion) 

June  24,  1981 
(after  text 
inversion) 

^Projection 

Work  Unit 

6,399 

13,969 

12,399 

Program  Planning 

1,151 

3,968 

2,301 

Independent  Research 

1,555 

4,736 

2,905 

Total s 

9,105 

22,400 

17,605 

Number  of  tracks  per  disk  pack  are  approximately  19,000. 


The  new  inverted  files  are  much  larger  than  the  previous  ones.  For 
comparison,  the  old  files  resided  on  9,105  disk  tracks  or  less  than 
half  a  disk  pack  and  the  new  free  text  files  reside  on  22,400  disk 
tracks  or  just  over  one  disk  pack.  Projections  had  been  made  for  the 
management  data  base  file  sizes  in  January  1981  based  on  the  Technical 
Reports  experimentation.  (See  appendix  0.)  These  estimates  were  just 
under  the  actual  implementation  inverted  file  sizes.  The  narrative 
portions  of  the  management  data  bases  are  slightly  longer  than  the 
technical  report  title  and  abstract;  consequently,  their  estimated 
file  sizes  were  a  little  conservative. 


■k 


Projection  based  on  Technical  Reports  experiment.  See  appendix  0. 


HOW  WILL  COMPUTER  RESPONSE  TIME  BE  AFFECTED  WITH  THE  LARGER  INVERTED 
FILES? 

Each  of  the  29  Work  Unit  Information  System  test  searches  was  run  15 
times  under  a  variety  of  conditions.  For  each  search  the  computer 
time  was  noted.  (See  appendix  E.) 

Each  run  of  the  searches  was  characterized  by  two  parameters  -  type  of 
search  (combined,  descriptor,  or  free  text)  and  the  operating 
condition  under  which  the  search  was  conducted  (normal  search  in  the 
online  system  with  normal  searching  background  prior  to  free  text 
inversion  =  N,  test  system  search  in  the  test  online  system  with  a  two 
terminal  searching  background  and  the  free  text  terms  were  included  = 
T,  or  normal  search  in  the  online  system  with  normal  searching 
background  and  free  text  terms  were  included  =  P).  The  computer  times 
are  cumulated  and  averaged  at  the  bottom  of  the  chart.  The  cumulated 
times  are  graphed  for  comparison  in  appendix  f.  It  was  expected  that 
the  Test  (T)  searches  would  take  less  time  than  the  Production  (P) 
searches.  The  online  system  was  handling  a  normal  workload  during  the 
production  phase  and  only  one  or  two  terminals  during  the  test  phase 
of  implementation.  (For  search  dates  refer  to  appendix  c.)  The  run 
labeled  "RUS",  to  the  far  right  in  appendix  E,  is  an  extra  free  text 
run  that  had  a  date  range  added  to  limit  the  search  results.  It  has 
been  included  here  to  show  how  an  additional  boolean  operator  can  add 
to  the  computer  time. 

The  conclusion  that  can  be  drawn  from  appendix  F  is  that  it  takes 
significantly  more  computer  time  to  search  with  free  text  terms. 

There  are  several  reasons  for  the  longer  time: 

a.  There  are  more  documents  retrieved;  therefore,  the  computer 
has  more  information  to  manipulate.  (The  next  section  will  explore 
this  in  more  detail .) 

b.  In  free  text  searching,  a  number  of  boolean  operators  must  be 
used  to  equate  a  single  level  search  or  a  multi-word  descriptor. 
Combining  two  free  text  words  will  take  the  computer  longer  to  perform 
than  the  table  look  up  for  a  descriptor. 

c.  The  masking  or  truncation  capability  is  used  more  frequently 
in  free  text  searching  and  requires  more  computer  time. 

d.  For  a  comprehensi ve  search  the  descriptors  must  be  added  on 
each  boolean  level.  This  requires  additional  time  for  the  computer  to 
merge  the  results. 

WILL  THE  NUMBER  OF  RECORDS  RETRIEVED  BE  SIGNIFICANTLY  LARGER? 

As  each  test  search  was  completed  the  number  of  documents  retrieved 
was  noted.  (See  appendix  G.)  The  structure  of  the  chart  is  the  same 
as  described  above  in  the  computer  response  time  section.  The  number 
of  documents  retrieved  per  search  run  are  graphed  for  comparison  in 
appendix  H.  The  slight  variation  among  groups  is  due  to  file  updates. 
As  can  be  seen  in  the  graph,  free  text  searching  gives  over  a  100 
percent  increase  in  recall  for  the  combined  searches. 
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HOW  WILL  RELEVANCE  OF  RETRIEVAL  USING  FREE  TEXT  BE  AFFECTED? 


To  gather  documents  for  relevance  review,  a  free  text  search  was 
conducted  with  a  date  range  added  to  limit  search  results.  These 
searches  are  labled  "RUS"  on  the  previous  charts.  The  results  were 
ordered  for  20  searches  and  ten  documents  of  each  were  examined  for 
rel evance. 

There  are  many  problems  with  using  relevance  as  an  evaluation 
criteria.  Relevance  judgments  are  subjective  and  therefore  people 
will  disagree.  Many  authors  have  faced  this  problem  and  Donald  W. 
King  addressed  it  in  the  1968  Annual  Review  of  Information  Science  & 
Technology. 

The  reports  also  reveal  the  hazards  of  using 
relevance  assessments  blindly.  The  prescriptions 
are  given  for  avoiding  such  pitfalls,  however,  and 
one  must  conclude  that  since  relevance  is  a  concept  that 
cannot  be  avoided,  it  must  be  used.  However,  it  is 
comforting  to  know  that  one  can  now  properly  feel 
nervous  in  using  it .£/ 

For  purposes  of  this  study  documents  were  judged  in  one  of  three 
categories,  relevant,  marginally  relevant,  or  not  relevant,  based  on 
the  following  criteria. 

A.  Documents  which  could  relate  to  the  topic  directly  or  from 
which  the  answer  could  be  inferred. 

B.  Documents  which  directly  related  to  the  topic. 

C.  Documents  which  contained  ideas  or  facts  that  could  be  useful 
in  considering  the  topic. 

The  subject,  the  document  numbers,  and  relevance  standing  for  each  of 
the  twenty  searches  are  listed  in  appendix  I.  In  addition,  the 
documents  are  noted  under  the  column  labeled  "match"  if  it  would  have 
fallen  in  a  descriptor  search.  The  results  are  summarized  under 
comments. 

The  free  text  relevance  judgements  are  plotted  in  appendix  J  for 
comparison.  The  first  column  represents  the  cumulative  total  of  the 
200  documents  reviewed.  If  the  20  searches  had  been  conducted  by 
descriptors  alone  only  23  percent  of  the  documents  would  have  been 
retrieved. 


5.  D.W.  King,  "Design  &  Evaluation  of  Information  Systems"  in 
Annual  Review  of  Information  Science  &  Technology,  vol  .  3,  C.  Cuadra, 
ed.,  (Chicago:  Encyclopaedia  Britannica,  1§66),  pp.  61-103. 
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The  more  important  question  of  relevance  is  plotted  in  the  next  three 
columns.  For  the  200  documents  retrieved:  56.5  percent  were  judged 
relevant,  26.5  percent  were  judged  marginally  relevant,  and  17  percent 
were  judged  not  relevant.  Descriptor  recall,  for  the  200  test 
documents,  is  plotted  on  top  of  the  free  text  results.  For  the 
documents  judged  to  be  relevant  only  20  percent  would  have  been 
retrieved  using  descriptors  only.  It  should  be  noted  that  the  high 
rate  of  false  drops  in  the  not  relevant  column  was  for  the  most  part 
due  to  search  construction  problems.  Searcher  experience  can 
significantly  reduce  the  false  drop  rate. 


CONCLUSIONS 

Free  text  searching  has  been  very  successful  in  the  DR0LS  management 
data  bases.  It  had  been  feared  that  the  much  larger  inverted  files 
would  seriously  affect  response  time.  In  appendix  E,  the  search 
labeled  "WHI"  was  run  after  free  text  had  been  implemented  in  all  the 
management  data  bases,  and  the  new  feature  had  been  made  available  to 
the  entire  online  community.  The  response  time  for  this  search,  as 
compared  to  the  other  combined  searches,  took  no  longer.  The 
background  of  larger  inverted  files  and  normal  use  seems  to  have 
little  effect  on  the  computer  response  time.  While  a  combined  search 
takes  longer  than  a  descriptor  search,  it  makes  very  little  difference 
to  the  user  who  does  not  wish  to  retrieve  free  text  terms. 

The  primary  advantage  of  using  free  text  searching  is  access  to 
relevant  documents  that  have  been  previously  impossible  or  difficult 
to  locate.  As  searchers  gain  experience  in  the  use  of  free  text 
searching,  results  can  be  limited  to  a  much  more  relevant  set  of 
documents  than  has  been  possible.  This  study  did  not  use  the  string 
searching  (text  search)  capability  that  has  been  available  to  DR0LS 
users  for  some  time,  however,  the  combination  of  free  text  inverted 
file  searching  and  string  searching  of  the  direct  file  can  greatly 
narrow  results  to  a  few  highly  relevant  documents.  It  should  be  noted 
that  some  relevant  documents  may  be  missed  using  the  combined  methods, 
because  authors  may  not  use  the  precise  phrases  the  searcher  selects. 


RECOMMENDATIONS 
It  is  recommended: 

That  the  free  text  searching  capability  be  maintained  in  the 
management  data  bases  after  the  six  month  trial  period. 

That  the  user  community  be  carefully  trained  to  take  advantage  of  the 
benefits  and  to  avoid  the  pitfalls  of  free  text  searching. 

That  the  free  text  searching  capablilty  be  expanded  to  the  Technical 
Reports  Data  Base  if  the  mass  storage  is  available. 
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Appendix  A 


TEXT  WORD  SEARCHING  IN  THE  MANAGEMENT  DATA  BASES 


INTRODUCTION 


The  Text  Word  Searching  capability  allows  the  user  to  directly  search  for 
those  records  in  the  management  data  base  inverted  files  containing 
specific  words  or  phrases. 

VOCABULARY 

1.  All  single  words  from  the  title  and  narrative  fields  are  posted  to  the 
inverted  file  (except  stop  words).  All  the  words  following  the  example 
narrative  will  be  posted  to  the  inverted  file. 

EXAMPLE: 

This  paper  proposed  a  waveform-agile  radar,  analyzes  the 
signal-processing  required,  and  describes  the  state  of  the  art  of 
two  new  acusto-electric  devices  needed  for  the  agile  radar:  the 
memory  correlator  (Model  R-10),  which  provides  electronically 
programmable  matched  filtering  of  individual  pulses,  and  the  coherent 
Integrator,  which  is  being  developed  to  provide  electronically 
programmable  doppler  processing.  The  electronic  programmability  of 
these  compact  wideband  analog  devices  gives  them  unique  functional 
capabilities  which  may  allow  future  tactical  systems  to  achieve 
electronic  counter  measure  resistance  and  low  probability  of  intercept 
by  facilitating  the  use  of  continually  changing  wideband  waveforms. 


10 

electronic 

processing 

achieve 

electronically 

programmability 

acusto 

facilitating 

programmable 

agile 

filtering 

proposes 

allow 

functional 

pulses 

analog 

future 

r 

analyzes 

gives 

radar 

art 

individual 

resistance 

capabilities 

integrator 

signal 

changing 

intercept 

state 

coherent 

low 

systems 

compact 

matched 

tactical 

continually 

measure 

them 

correlator 

memory 

two 

counter 

model 

unique 

developed 

needed 

waveform 

devices 

new 

waveforms 

doppler 

electric 

paper 

probability 

wideband 

2.  All  stop  words  are  deleted  from  the  narrative.  Attachment  1  is  the 
stop  word  list  composed  and  used  by  DTIC. 


Appendix  A  (cont.) 

3.  Text  words  allow  the  searcher  to  find  highly  relevant  summaries  if 
used  properly.  Do  not  search  text  words  for  dogs  or  dog  if  you  are 
looking  for  summaries  on  German  Shepherds.  Use  the  specific  terminology 
German  and  Shepherds . 

4.  Highly  posted  terms  should  never  be  used  as  the  first  search  term  in 
your  search  strategy.  Helicopters  and  Design  is  more  efficient  than  Design 
and  Helicopters  because  design  is  a  very  highly  posted  term  and  the 
computer  has  fewer  items  to  examine  if  the  smaller  term  Helicopters  is  used 
first. 

5.  When  using  the  text  searching  capability,  some  terms  may  result  in  finds 
of  more  than  the  25,000  limit.  Use  of  accession  number  ranging  would 
eliminate  the  problem  of  an  incomplete  search. 

TEXT  WORD  SEARCH  STRATEGY  FORMULATION 

1.  Text  words  are  searched  using  role  code  60.  Text  words  will  be 
excluded  in  normal  subject  searches. 

EXAMPLE: 

TELECOMMUNICATIONS  will  retrieve  everything  but  text  words 
?60TELEC0MMUNI CATIONS  will  retrieve  only  text  words 

2.  The  maximum  length  of  a  text  word  search  term  is  60  characters. 

3.  Text  words  may  be  truncated  to  pick  up  any  words  with  the  same  word 
root.  This  option  should  be  used  carefully  to  guard  against  too  many  false 
finds. 


EXAMPLE: 


?60%DOG  will  retrieve  dog  and  dogs  along  with  dogmatic ,  dogma ,  etc. 


4.  Blanks  and  special  characters 
characters  are  treated  as  blanks. 

EXAMPLES: 

Narrative  Phrase 

2,  4-D 

AN/800-1 

F-15 

TF30-P-3 

Cobal t-Alloyed  Silicon 
1 ,000  Ton  Trucks 


a**e  term  separators.  All  special 
There  are  no  exceptions. 


Search  Terms 
2  and  4  and  D 

800  and  1  (AN  is  a  stop  word) 
F  and  15 

TF30  and  P  and  3 

Cobalt  and  Alloyed  and  Silicon 

1  and  000  and  Ton  and  Trucks 
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Appendix  A  (cont.) 

5.  Alphanuraerics ,  Chemical  Terminology,  and  Proper  Names  can  be  searched  as 
text  words. 

EXAMPLES: 

Alphanuraerics  -  M-16,  F-16,  45mm  (The  dash  is  a  special  character  and 

is  treated  as  a  blank.) 

Chemical  Terminology  -  Tetracycline,  Acetylsalicylic  Acid,  H2S04 

(See  Verbalizing  for  Machinability  Rules, 
Attachment  2.) 

Proper  Names  -  Laplace  Transform,  Einstein  Equations,  Hodgkins  Disease 
(When  used  in  the  search-  Laplace  and  Transform, 
Einstein  and  Equations,  Hodgkins  and  Disease.) 

6.  Variant  spellings  and  word  forms  should  be  considered  during  search 
strategy  formulation. 

EXAMPLES : 

Spellings  -  Color,  Colour,  Programmer,  Programer 
Forms  -  Mine,  Mines,  Mined,  Mining 

7.  Synonyms  should  be  considered  during  search  strategy  formulation. 

EXAMPLE : 

Marijuana,  Marihuana,  Pot,  Grass,  Weed,  Mary  Jane 

8.  Foreign  spellings  should  be  considered  during  search  strategy 
formulation. 

EXAMPLE : 

Radiatsii  (Russian),  Radiation  (French),  Strahlung  (German), 

Strating  (Dutch) 

9.  Common  misspellings  should  be  considered  during  a  comprehensive  search 
strategy  formulation. 

EXAMPLE : 

Retrieval ,  Retreival 
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10.  Text  word  searching  may  be  used  in  conjunction  with  Text  Title  and 
Narrative  searching.  The  text  word  search  narrows  the  search  to  summaries 
which  contain  the  specific  words  to  be  searched. 


EXAMPLE:  A  search  for  summaries  on  Solar  Collectors 


Inverted  File  Search  for  Specific  Terms 


@SWU@ 

760S0LAR 

AND 

760C0LLECT0R 

760C0LLECT0RS 

END 


-  Data  Base  Command 

-  Text  Word  Role  Code  and 

-  Boolean  operator 

-  Text  Word  Role  Code  and 

-  Text  Word  Role  Code  and 

-  Search  terminator 


Term 

Term 

Term 


Text  Search  (Qualification) 


@SRTAB@ 

SOLAR  COLLECTOR 
SOLAR  COLLECTORS 
END 


-  Scan  Title  and  Narrative  Command 

-  Text  string  to  be  searched. 

-  Text  string  to  be  searched. 

-  Search  terminator 


To  display  the  results,  use  the  Display  Qualified  Results  Command,  @DQR@. 


j 

.1 


Appendix  A  (cont. ) 

VERBALIZING  FOR  MACHINABILITY 

The  following  symbols  may  be  used  in  abstracts,  annotations  and  titles: 

•  ♦  :  ;’/*$%()  -+=<>&? 

HMtH  f  t  +tt  t 

(Arrows  indicate  upper  and  lower  shift  characters.) 

The  following  symbols  may  be  used  in  indexing  terms: 

FRACTIONS 

Use  the  slash  ( virgule)  for  the  fraction  bar ;  e.g., 

x=iL^  use  x=(a-b)/c 
\  c 

x  =  a-  use  x=a  -  (b/c) 


■  /  (  ) 


ACCENTS/DIACRITICAL  MARKS 

Omit  except  in  the  following  cases: 
a  is  replaced  by  ae 
o  is  replaced  by  oe 
ii  is  replaced  by  ue 
■f>  is  replaced  by  oe 


Germanic  languages 


ANGSTROM  UNITS  (A) 

Use  A 

CHEMICALS 

ll3S04  use  H2S()4 
->  use  yields 

Sec  also  PRIMES.  SUBSCRIPTS.  SUPERSCRIPTS 
CUBIC 

cm3  use  cii  cm  or  cc 
ft3  use  cu  ft 
m3  use  cu  m 


GREATER  THAN  OR  EQUAL  TO  (  _>  ) 

Use  >  or  = 

GREEK  LETTERS 

Use  their  names;  e.g., 
oc  use  alpha 
3  use  beta 

y  use  mu,  micro,  or  micron  as  applicable 
tt  pi 

See  also  SPECIAL  SYMBOLS 


1 


See  also 


use  I  /( x  cubed ) 

EXPONENTS 


ITALICS 

Do  not  use ;  see  also  UNDERSCORING 

LESS  THAN  OR  EQUAL  TO  (<.  ) 

Use  <  or  = 


DEGREES 

Angles 

Latitude  (Longitude: 
Temperautre: 


60°  30’ 1 5”  use  oO  deg  30  min  15  sec 
Add  N,  S,  E,  or  W  at  end  of  expression. 
60°30M5”  N  use  60  deg  30  min  15  sec  N 
60°  C  use  60  C 
60°  F  use  60  F 
60°  K  use  60  K 


DIACRITICAL  MARKS 

See  ACCENTS/DIACRITIC  At  MARKS 

EXPONENTS 

x(n*i)  m.  x  lo  (be  (n-l )  power 
ft  sec  1  use  1 1  /sec 

When  the  cx/xment  is  less  than  7  and  has 

the  base  10.  write  <»ut  the  number ;  e.g., 

t  i)2  ttsc  1 00 

K)  *1  use  0.0001 

2.75  x  10  n  use  0.00275 

When  the  base  is  10  and  the  exponent  is  7 

or  more,  write  out,  e.g., 

I07  use  10  in  (he  7th  power 
10  9  use  10  to  the  minus  *>th  power 
See  also  CUBIC.  SQUARE,  SUPERSCRIPTS 


LOGARITHMS 

l°g  10  U*e  log 

log,  use  In  (alpha  *7”  not  digit  “l”) 

MICRO-  or  MICROM1CRO- 

p  1  use  microliters 
u  8  use  microsecs 
yv  use  microvolts 

y  U  f  use  micromicrofarads  or  picofarads 

MICRONS 

y  use  micron(s)  when  applicable 
my  use  millimicron(s) 
um  use  rriicrometer(s)e>r  micron(s) 
y  y  use  micromicron(s)  or  picometer(s) 
um  use  micrometer(s)  or  micron(s)  when  applicable 
PLUS  OR  MINUS  (  ±  ) 

Use  +  or  - 

PRIMES  O 

Use  repeating  apostrophe 

2%2\2"  -terpy ridyl  use  2,2\2"  -terpyridyl 
a’”(  a  triple  prime)  use  a'  *  ’ 


-16- 


Atch  2 
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QUOTATION  MARKS  D 

Use  the  a  f* ‘Strop  he  or  single  quote  only. 

Where  quotation  marks  arc  comcntionallv  use.J  as  a  symbol, 
abbreviate;  c.g.. 
i _  ”  use  1  3  in. 

5**/54  cal  use  5-m./54-eal  guns 
45“  use  45  <ec 
See  also  DEGRl  ES,  PRIMES 


SPECIAL  SYMBOLS 

**  l  use  aj >pio\. 

^  i 

j  mao  \  ic  ids  (ehennstryj 
|  use  .  i  p  p  r  o  a  c !  i  c  s  I  i  m  i  t  '  4  /  ma  i  h  <  mat  it  w ) 

K  ;/ve  K 

//  mac  nuiuK  , 

»•>  /JSc  infinity 

j  n\'  vv.iwli’n^th  ieleetronirs  and  (ihysies/ 

|  wye  f .  I : ! ;  I;  J  a  'all  >'her/ 

itsc  oh  1  n  (electricity  and  electr  mics ) 
use  omega  (all  or  her  I 

q  |  use  phase  (electricity  and  electronics} 

\  use  phi  fall  other) 

Siniilarlv .  spell  out  or  show  hv  acceptable  alphanumeric  characters 
iiu  icnient.  varies  as,  therefore,  differential  ol,  variation  of,  integral, 
sum,  hen/enc  mig,  thunderstorm,  male  fenule.  fixed  stai.  etc. 


SUBSCRIPTS 

PH  use  P  sub  H 
use  V  sub  2 

Bs  use  H  (omit  the  5,  which  is  the  atomic  number  of  boron ) 
C  use  (14  if  it  is  the  isotope  of  carbon 
Set  also  CHEMICALS,  SUPERSCRIPTS 


SUPERSCRIPTS 
34(  use  (14 

H*  use  H  (+) 

SO  4  use  S04  ( -) 

Vs*  use  V  (5+) 

lj?34  use  U234 

BJO  use  BIO 

0lfi(p,n)  N15  018(pji)N15 

4^3  itsc  density  at  23  deg  F  referred  to  water  at  25  deg  F 

n20  use  index  of  refraction  for  20  deg  F  and  sodium  light 

See  also  CUBIC.  EXPONENTS,  SQUARE 


UMLAUT  see  ACCENT 


UNDERSCORING 

Do  not  use  underscoring. 

Escherichia  coli  use  Escherichia  coli 
quasar  use  quasar 


SQUARE 

cm2  use  ^q  cm 
ft2  use  sq  ft 
nr  five  q  m 

j/vv  I , (  v  squared) 
Sec  also  EXPONENTS 


SQUARE  ROOT 

-fCb  | 

(a-b)'”  ) 


use  square  root  ot  (a  b) 
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A 

AFTER 

ALSO 

AN 

AND 

ANY 

ARE 

AS 

AT 

AUTHOR 

AVAILABLE 

BE 

BEEN 

BEING 

BETWEEN 

BOTH 

BUT 

BY 

C 

CAN 

CFRD 

CONDUCTED 

CONSIDERED 

COULD 

CRD 

DESCRIBED 

DESCRIBES 

DESIGNED 

DETERMINE 

DETERMINED 

DIFFERENT 

DISCUSSED 

DUE 

DURING 

EACH 

FOR 

FOUND 

FROM 

FURTHER 


STOP  WORD  LIST 


HAS 

HAVE 

HOWEVER 

IF 

IN 

INCLUDED 

INTO 

INVESTIGATED 

IS 

IT 

ITS 

MADE 

MAY 

MORE 

MOST 

NO 

NOT 

OBTAINED 

OF 

ON 

ONLY 

OR 

OTHER 

OUT 

PERFORMED 

POSSIBLE 

PRESENT 

PRESENTED 

PRESENTS 

PROVIDE 

PROVIDED 

PROVIDES 

RELATED 

REPORT 

REQUIRED 

RESULTS 

S 

SEE 

SELECTED 

SEVERAL 


SFRD 

SHOULD 

SHOWN 

SIGNIFICANT 

SOME 

SRD 

STUDIES 

SUCH 

TESTED 

THAN 

THAT 

THE 

THEIR 

THERE 

THESE 

THEY 

THIS 

THOSE 

THROUGH 

TO 

TYPES 

U 

UNDER 

UP 

USE 

USED 

USING 

VARIOUS 

VERY 

WAS 

WERE 

WELL 

WHEN 

WHERE 

WHICH 

WHILE 

WILL 

WITH 

WITHIN 

WITHOUT 

WOULD 


GENERAL 

GIVEN 
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SEARCH  HISTORY 


Search  Date 

Color  Code 

Version 

April  22,  1981 

Orange 

Normal 

Purple 

Normal 

April  29,  1981 

Red 

Test 

Yellow 

Test 

Green 

Test 

Avocado 

Test 

Black 

Test 

Brown 

Test 

May  13,  1981 

Pi  nk 

Production 

Light  Blue 

Production 

Silver 

Production 

Rust 

Production 

May  14,  1981 

Peach 

Production 

Lavender 

Production 

Green/brown 

Production 

Jun  23,  1981 

Whi  te 

Production 

Normal  = 
Test  = 
Production  = 


Online  search  prior  to  free  text  implementation. 

Test  online  system  without  normal  user  background. 

Online  search  after  free  text  implementation  with  normal 
user  background. 


-18- 


Appendix  D 


January  1981 


TEXT  WORD  INVERTED  FILE  SIZE  PROJECTIONS 


About  15,000,000  words  or  the  TR  file  were  processed.  This  generated 
an  inverted  file  of  3,960  tracks  (containing  only  text  words).  The 
following  predictions  are  made  for  OTIC  data  bases  based  on  the  TR 
sampl e. 


TR 

+27,000 

tracks 

wins 

+  6,000 

tracks 

RDPP 

+  1,150 

tracks 

IRD 

+  1,350 

tracks 

Note:  One  disk  drive  contains  about  19,000  tracks. 

These  figures  are  based  on  data  collected  during  the  summer  of 
1980. 


i 

I 


! 


B  BANT) 

CHROMATOGRAPHIC  ANALYSES 
GAMMA  RAY  SCATTERING 

naval  personnel 
SOLAR  DISTURBANCES 
ARMY  AIRCRAFT 
RADAR  CAMOUFLAGE 
F-l  5 

AIR  FILTERS 

AIR  TO  SURFACE  AND  MISSILES 
YAWMETERS 

COLDWEATHER  AND  CLOTH 
SALT  WATER 

UNDERWATER  NAVIGATION 
TWO  DIMENSIONAL  FLOW 
ARTIFICIAL  INTELLIGENCE 
SUPERCONDUCT  AND  THIN  FILMS 
PACKET  AND  SWITCHING 
AERODYNAMIC  CONFIGURATIONS 
FUEL  TANKS 
ULTRASONIC  WELDING 
TURBOCHARGERS 
SONAP  SIGNALS 
RESURPINE 
SULFURIC  ACID 
PHOTOSENSITIVITY 
AUTOMATIC  WEAPONS 
FUEL  CONSUMPTION 
CHEMICAL  WARFARE 

TOTAL 

AVERAGE 


N  =  Normal  online  search  prior  to  free  text 
T  *  Test  system  search  with  free  text  terms 
P  =  jormal  online  search  with  free  text  terms 
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APPENDIX  I 


FREE  TEXT  RELEVANCE 


B  BAND 

R 

MR 

NR 

MATCH 

COMMENTS 

DA0E8756 

* 

DF091560 

* 

0  =  R 

DF098560 

* 

2  =  MR 

DF102060 

* 

8  =  NR 

DF105090 

* 

0  =  Match 

DF105660 

* 

DF106120 

* 

Many  false  drops  on  this  search 

DF109340 

* 

because  of  paragraph  or  point 

DF110380 

* 

headers.  For  example:  (b)  or  B. 

DF425401 

* 

SALT  WATER 

R 

MR 

NR 

MATCH 

COMMENTS 

DN980143 

* 

DA0G21 15 

* 

3  =  R 

DA0G6301 

* 

7  =  HR 

DF320620 

* 

* 

0  =  NR 

DF327510 

* 

* 

2  =  Match 

DF906950 

* 

DN123525 

* 

DN594247 

* 

DN675229 

* 

DN675500 

* 

UNDERWATER  NAVIGAT: 

R 

MR 

NR 

MATCH 

COMMENTS 

DN775448 

* 

DN788021 

* 

3  =  R 

DN088001 

* 

4  =  MR 

DN088018 

* 

* 

3  =  NR 

DN178106 

* 

1  =  Match 

DN188114 

* 

DN223696 

* 

DN888728 

* 

DN989078 

* 

DN996404 

* 

R  =  Relevant  document 

HR  ■  Marginally  relevant  document 

NR  ■  Document  not  relevant  (false  drop) 

Match  *  Document  would  have  been  retrieved  by  descriptor  or  open-ended 
term. 


-24- 


APPENDIX  I  (cont) 


FREE  TEXT  RELEVANCE 


FUEL  TANKS 

R 

WS5 

inr 

BittUai 

COMMENTS 

DF187070 

* 

DF188580 

* 

9  =  R 

DF643510 

* 

* 

1  =  MR 

DN782488 

* 

k 

0  =  NR 

DA0B3658 

* 

6  =  Match 

DA0B4016 

* 

k 

DA0D3912 

* 

* 

DA0E3821 

* 

DA0G0276 

* 

* 

DA0G0277 

* 

* 

SONAR  SIGNALS 

R 

IM 

MATC^ 

COMMENTS 

DN675355 

■ 

H 

DN775335 

J 

I 

10  =  R 

DN775448 

■ 

■ 

1 

0  =  MR 

DN888567 

■ 

■ 

0  =  NR 

DN088009 

|Q 

■ 

■ 

■  ■ 

3  =  Match 

DN089052 

El 

■ 

■ 

DN136527 

1 

DN223092 

Q 

■ 

■ 

WBM 

DN223873 

1 

MB 

DN489151 

■ 

■ 

1  ■  ^ 

AIR  TO  SURFACE  MIS: 

R 

us 

mm 

COMMENTS 

DA0Q4521 

* 

DF108520 

* 

3  =  R 

DF108980 

* 

1  =  MR 

DF109240 

* 

6  =  NR 

DF109340 

* 

0  =  Match 

DF109600 

* 

DF1097 10 

* 

DF110020 

* 

DF111170 

* 

DF1 11640 

* 

R  -  Relevant  document 

MR  »  Marginally  relevant  document 

NR  -  Document  not  relevant  (false  drop) 

Match  *  Document  would  have  been  retrieved  by  descriptor  or  open-ended 


APPENDIX  I  (cont) 


FREE  TEXT  RELEVANCE 


AIR  FILTERS 

R 

MR 

m 

MATCH 

COMMENTS 

DF103300 

* 

DF107560 

* 

0  =  R 

DF108330 

* 

1  =  MR 

DF115120 

* 

9  =  NR 

DF119580 

* 

0  =  Match 

DF301320 

* 

DN220112 

* 

DN678752 

* 

DN713011 

* 

DA0B0758 

* 

i 

F-15 

R 

MR 

NR 

MATCH 

COMMENTS 

DF106240 

* 

DF110240 

* 

5  =  R 

DF110380 

* 

5  =  MR 

DF110980 

* 

0  =  NR 

DF111130 

* 

0  =  Match 

DF111150 

* 

DF111480 

* 

DF111610 

★ 

DF111960 

* 

DF112020 

* 

RADAR  CAMOUFLAGE 

R 

MR 

NR 

MATCH 

COMMENTS 

DF101600 

* 

DF110900 

* 

6  =  R 

DA0F4817 

* 

4  =  MR 

DA0G0236 

* 

0  =  NR 

DA0G0268 

* 

1  =  Match 

DA0G2484 

* 

DAOG4757 

* 

DA0G4795 

* 

DA0G5166 

* 

* 

DA0G9130 

* 

R  *  Relevant  document 

MR  *  Marginally  relevant  document 

NR  *  Document  not  relevant  (false  drop) 

Match  *  Document  would  have  been  retrieved  by  descriptor  or  open-ended 


APPENDIX  (cont) 


FREE  TEXT  RELEVANCE 


CHROMATOGRAPHIC  AN: 

R 

MR 

NR 

MATCH 

COMMENTS 

DN880254 

* 

DA0A6949 

* 

7  =  R 

DAOA7323 

* 

* 

3  =  MR 

DA0B6242 

it 

0  =  NR 

DA0B6536 

* 

1  =  Match 

DAOB6937 

* 

DA0C7456 

* 

DA0D9417 

* 

DA0E6313 

* 

DA0F0088 

* 

FUEL  CONSUMPTION 

R 

1 M 

LN 

[  MATCH 

COMMENTS 

DA0D3220 

* 

■ 

■ 

DAOD3999 

* 

■ 

1 

1 

9  =  R 

DA0D4101 

* 

I 

I 

1  =  MR 

DA0D4106 

* 

■ 

■ 

0  =  NR 

DA0E38 12 

* 

■ 

■ 

4  =  Match 

DA0E3861 

* 

■ 

■ 

■ 

DA0G0501 

* 

I 

1 

mm 

DA0G1072 

9 

I 

DA0G3120 

* 

m 

■ 

DA0G3375 

* 

■ 

■ 

■1 

ARMY  AIRCRAFT 

R 

m 

m 

m hbp 

Britt*!  t 

COMMENTS 

DA0A4996 

* 

DAO BO 7 19 

* 

* 

9  =  R 

DA0B4016 

* 

it 

1  =  MR 

DA0B6893 

* 

0  =  NR 

DA0C2685 

* 

6  =  Match 

DAOC3427 

* 

DA0C3431 

it 

* 

DA0C3439 

* 

* 

DA0C4009 

* 

* 

DA0C6130 

* 

* 

R  ■  Relevant  document 

MR  “  Marginally  relevant  document 

NR  *  Document  not  relevant  (false  drop) 

Match  »  Document  would  have  been  retrieved  by  descriptor  or  open-ended 
t-prm. 


APPENDIX  I  (cont) 


FREE  TEXT  RELEVANCE 


NAVAL  PERSONNEL 

R 

MR 

EB 

COMMENTS 

DA0J4338 

* 

* 

* 

DN023002 

* 

10  =  R 

DN023238 

* 

* 

0  =  MR 

DN023256 

* 

* 

0  =  NR 

DN063002 

* 

* 

8  =  Match 

DN075040 

* 

* 

DN075096 

* 

* 

DN075244 

* 

* 

DN075338 

* 

* 

DN075376 

* 

AUTOMATIC  WEAPONS 

R 

\M 

1  td 

matchI 

COMMENTS 

DA0B0700 

* 

■ 

■ 

DA0B0701 

* 

■ 

H 

■ 

9  =  R 

DA0B0714 

* 

1 

1  *  MR 

DA0B07 15 

* 

0  =  NR 

DA0D4922 

m 

0  =  Match 

DA0F1403 

* 

I 

fj 

■ 

DA0G0297 

* 

DA0G2427 

* 

m 

DA0G5025 

* 

I 

■ 

II 

DA0G6 104 

* 

I 

■ 

PACKET  SWITCHING 

R 

EE 

COMMENTS 

DA0G0858 

* 

DA0G8985 

* 

5  =  R 

DA0G9169 

* 

* 

4  =  MR 

DF051570 

* 

1  =  NR 

DF057420 

* 

1  =  Match 

DF721300 

* 

DF724I10 

* 

DF917470 

* 

DN080028 

* 

DN680068 

* 

R  *  Relevant  document 

MR  *  Marginally  relevant  document 

NR  *  Document  not  relevant  (false  drop) 

Match  *  Document  would  have  been  retrieved  by  descriptor  or  open-ended 


APPENDIX  I  (cont) 


FREE  TEXT  RELEVANCE 


SOLAR  DISTURBANCES 

R 

IB?! 

IB? 

wrmsf 

liaUfcgtt 

COMMENTS 

DF046410 

* 

■ 

■ 

DF047370 

* 

■ 

■ 

7  =  R 

DF053380 

■ 

2  =  MR 

DF053770 

* 

■ 

■ 

1 

1  =  NR 

DF247950 

* 

I 

3  =  Match 

DF248790 

* 

I 

IBB 

DF248840 

* 

H 

DF249000 

* 

DF249260 

1 

D 

■ 

DF249650 

D 

■ 

SULFURIC  ACID 

R 

IBs 

most 

COMMENTS 

DA0E9105 

* 

* 

DA0E9204 

* 

0  =  R 

DA0G0939 

* 

* 

9  =  MR 

DA0G4209 

* 

1  =  NR 

DA0G9065 

* 

2  =  Match 

DA0G9095 

* 

DAOJ7577 

* 

DA0Q7892 

* 

DF043420 

* 

DF180490 

* 

AERODYNAMIC  CONFIG: 

R 

IB? 

NR 

MATCH 

COMMENTS 

DF051110 

* 

DF057210 

* 

10  =  R 

DF120720 

* 

0  =  MR 

DF138610 

* 

* 

0  =  NR 

DF139410 

* 

* 

4  =  Match 

DF139830 

* 

DF139870 

* 

DF140010 

* 

* 

DF143410 

* 

DF143520 

* 

* 

K  =  Relevant  document 

MR  =  Marginally  relevant  document 

NR  *  Document  not  relevant  (false  drop) 

Match  »  Document  would  have  been  retrieved  by  descriptor  or  open-ended 
term. 
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APPENDIX  I  (cont) 


FREE  TEXT  RELEVANCE 


RESURPINE 

R 

Iff? 

iff? 

MATCH 

COMMENTS 

DAOA6427 

* 

DA0A7032 

* 

* 

8  =  R 

DAOA9547 

* 

2  =  MR 

DA0E0086 

* 

0  =  NR 

DA0G6951 

* 

3  =  Match 

DF005720 

* 

DF007420 

* 

* 

DF384905 

* 

DF386717 

* 

* 

DF388520 

* 

TWO  DIMENSIONAL  FL: 

R 

MR 

NR 

MATCH 

COMMENTS 

DA0C5002 

* 

DA0C5004 

* 

0  =  R 

DA0D3235 

* 

5  =  MR 

DA0D3261 

* 

5  =  NR 

DA0D3999 

* 

1  =  Match 

DA0D8613 

* 

DA0D9443 

* 

DA0E9013 

* 

DA0E9022 

* 

* 

DA0E9040 

* 

R  *  Relevant  document 

MR  *  Marginally  relevant  document 

NR  *  Document  not  relevant  (false  drop) 

Match  »  Document  would  have  been  retrieved  by  descriptor  or  open-ended 
term. 
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Appendix  J 


100% 


FREE  TEXT  RELEVANCE  OVERLAID  BY  DESCRIPT 


Free  Text  ^ 

I 

i 

i 


